A New Approach to Online Generation of Association Rules
نویسندگان
چکیده
ÐWe discuss the problem of online mining of association rules in a large database of sales transactions. The online mining is performed by preprocessing the data effectively in order to make it suitable for repeated online queries. We store the preprocessed data in such a way that online processing may be done by applying a graph theoretic search algorithm whose complexity is proportional to the size of the output. The result is an online algorithm which is independent of the size of the transactional data and the size of the preprocessed data. The algorithm is almost instantaneous in the size of the output. The algorithm also supports techniques for quickly discovering association rules from large itemsets. The algorithm is capable of finding rules with specific items in the antecedent or consequent. These association rules are presented in a compact form, eliminating redundancy. The use of nonredundant association rules helps significantly in the reduction of irrelevant noise in the data mining process. Index TermsÐOLAP, association rules, data mining, knowledge discovery.
منابع مشابه
A new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining
Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...
متن کاملRetaining Customers Using Clustering and Association Rules in Insurance Industry: A Case Study
This study clusters customers and finds the characteristics of different groups in a life insurance company in order to find a way for prediction of customer behavior based on payment. The approach is to use clustering and association rules based on CRISP-DM methodology in data mining. The researcher could classify customers of each policy in three different clusters, using association rules. A...
متن کاملVoltage Control Approach in Smart Distribution Network with Renewable Distributed Generation
Voltage control is one of the imperative issues in the smart distribution control system. While traditional distribution network is equipped with communication and monitoring equipment, the online voltage control can be perfectly achieved. With using these smart grid technologies, the distribution voltage control schemes should carry out intelligently and cover the undesirable effect of high pe...
متن کاملOnline Judgment in the Context of International and National Rules: Ethical and Legal Challenges
Background: Online judgment is an economical and faster way than the judicial one. With the development of technology in recent decades, it has also been possible to make judgments online. Although few countries have incorporated this approach into their laws, online judgments are being developed and implemented in various areas such as international trade or intellectual property. The pres...
متن کاملAn Analysis of Circulation of Decentralized Digital Money in Quantum Electrodynamics Space: the Econphysics Approach
The study aimed at showing how to create and release cryptocurrency, based on which one can introduce a new generation of this money that can continue its life in the quantum computers space and study whether cryptocurrency could be controlled or the rules should be rewritten in line with new technology. Regarding this, we showed the evolution of money and its uses in economic relations. Accord...
متن کاملNew Approaches to Analyze Gasoline Rationing
In this paper, the relation among factors in the road transportation sector from March, 2005 to March, 2011 is analyzed. Most of the previous studies have economical point of view on gasoline consumption. Here, a new approach is proposed in which different data mining techniques are used to extract meaningful relations between the aforementioned factors. The main and dependent factor is gasolin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Knowl. Data Eng.
دوره 13 شماره
صفحات -
تاریخ انتشار 2001